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Abstract 

Recent information theoretic results suggest that precoding on the multi-user downlink MIMO channel 
with delayed channel state information at the transmitter (CSIT) could lead to data rates much beyond 
the ones obtained without any CSIT, even in extreme situations when the delayed channel feedback is 
made totally obsolete by a feedback delay exceeding the channel coherence time. This surprising result 
is based on the ideas of interference repetition and alignment which allow the receivers to reconstruct 
information symbols which canceling out the interference completely, making it an optimal scheme in 
the infinite SNR regime. In this paper, we formulate a similar problem, yet at finite SNR. We propose a 
first construction for the precoder which matches the previous results at infinite SNR yet reaches a useful 
trade-off between interference alignment and signal enhancement at finite SNR, allowing for significant 
performance improvements in practical settings. We present two general precoding methods with arbitrary 
number of users by means of virtual MMSE and mutual information optimization, achieving good 
compromise between signal enhancement and interference alignment. Simulation results show substantial 
improvement due to the compromise between those two aspects. 

Index Terms 

Multi-user MIMO, Delayed Feedback, Precoding, Interference Alignment 



I. Introduction 

Multi-user MIMO systems (or their information-theoretic counterparts "MIMO broadcast channels"), have 
recently attracted considerable attention from the research community and industry alike. Success is due to their 
ability to enhance the wireless spectrum efficiency by a factor equal to the number N of antennas installed 
at the base station, with little restriction imposed on the richness of the multipath channel, the presence or 
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absence of a strong line of sight channel component, and the fact it can easily accommodate single antenna 
mobile devices. On the downlink of such systems, the ability to beamform (i.e. linearly precode) multiple data 
streams simultaneously to several users (up to N) comes nevertheless at a price in terms of requiring the base 
station transmitter to be informed of the channel coefficients of all served users [1]. In frequency division duplex 
scenarios (the bulk of available wireless standards today), this implies establishing a feedback link from the 
mobiles to the base station which can carry CSI related information, in quantized format. A common limitation 
of such an approach, perceived by many to be a key hurdle toward a more widespread use of MU-MIMO methods 
in real-life networks, lies in the fact that the feedback information typically arrives back to the transmitter with 
a delay which may cause a severe degradation when comparing the obtained feedback CSIT with the actual 
current channel state information. Pushed to the extreme, and considering a feedback delay with the same order 
of magnitude as the coherence period of the channel, the available CSIT feedback becomes completely obsolete 
(uncorrected with the current true channel information) and, seemingly non exploitable in view of designing the 
precoding coefficients. 

Recently, this commonly accepted viewpoint was challenged by an interesting information-theoretic work 
which established the usefulness of stale channel state information in designing precoders achieving significantly 
better rate performance than what is obtained without any CSIT [2]. The premise in [2] is a time-slotted MIMO 
broadcast channel with a common transmitter serving multiple users and having a delayed version of the correct 
CSIT, where the delay causes the CSIT to be fully uncorrected with the current channel vector information. In 
this situation, it is shown that the transmitter can still exploit the stale channel information: The transmitter tries 
to reproduce the interference generated to the users in the previous time slots, a strategy we refer in this paper 
as interference repetition, while at the same time making sure the forwarded interference occupies a subspace 
of limited dimension, compatible with its cancelation at the user's side, a method commonly referred to as 
interference alignment [3,4]. Building on such ideas, [2] constructs a transmission protocol referred as the 
MAT protocol which was shown to achieve the maximum Degrees-of-Freedom (DoF) for the delayed CSIT 
broadcast MIMO channel. Precoding on delayed CSIT MIMO channels have recently attracted more interesting 
work, dealing with DoF analysis on extended channels, like the X channel and interference channels [5-7], but 
also performance analysis including effects of feedback [8] and training [9]. The DoF is a popular information 
theoretic performance metric indicating the number of interference-free simultaneous data streams which can 
be communicated over this delayed CSIT channel at infinite SNR, also coinciding with the notion of pre-log 
factor in the channel capacity expression. In the example of the two antenna transmitter, two user channel, the 
maximum DoF was shown in [2] to be |, less than the value of 2 which would be obtained with perfect CSIT, 
but strictly larger than the single DoF obtained in the absence of any CSIT. This means that completely obsolete 
channel feedback is actually useful. 

Although fascinating from a conceptual point of view, these results are intrinsically focussed on the asymptotic 
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SNR behavior, leaving aside in particular the question of how shall precoding be done practically using stale 
CSIT at finite SNR. This paper precisely tackles this question. In what follows we obtain the following key 
results: 

« We show finite SNR precoding using delayed CSIT can be achieved using a combination of interference 
repetition, alignment together with a signal enhancement strategy. 

• We propose a precoder construction generalizing the ideas of [2], namely Generalized MAT (GMAT), where 
a compromise between interference alignment and orthogonality within the desired signal channel matrix 
is striken, and generalize it to the scenario with arbitrary number of users. 

• The precoder coefficients are interpreted as beamforming vector coefficients in equivalent interference 
channel scenario, which can be optimized in a number of ways, including using an MMSE metric, and 
mutual information metric. To our best knowledge, the optimization of a finite SNR precoding scheme 
based on delayed feedback has not yet been addressed. 

Numerical evaluation reveal a substantial performance benefit in terms of data rate in the low to moderate SNR 
region, but coinciding with the performance of [2] when the SNR grows to infinity. Note that a preliminary set 
of results were reported recently in [10] for the 2-user case, while this paper provides a generalization to the case 
of arbitrary number of users. 

The rest of the paper is organized as follows. In Section II, the channel model of interest is described and the 
proposed GMAT protocol is detailed first in the 2-user case then is generalized to the If -user case. Section III 
focuses on the precoder optimization method based on MMSE and mutual information criteria. Discussion on 
the multiplexing gain and an interesting interpretation from an equivalent MIMO interference channel is given in 
Section IV. Numerical examples showing the advantages of the new methods are discussed in section V. Finally, 
Section VI concludes the paper. 

Notation: Matrices and vectors are represented as uppercase and lowercase letters, and transpose and conju- 
gate transpose of a matrix are denoted as (-) T and (•) H , respectively. Further, Tr(-), || • || and || • || F represent 
the trace of a matrix, the norm of a vector and a Frobenius norm of a matrix. We reserve [A] m „ to denote the 
element at the m-th row and n-th column of matrix A, and |<S| to the cardinality of the set S. Finally, an order-fc 
message denoted by u s (\S\ = k) refers to a linear combination of k distinct symbol vectors intended to k 
different users in set S. 

II. System Model 

Consider a K-user MU-MIMO downlink system with a transmitter equipped with K antennas and K single- 
antenna users. A time slotted transmission protocol in the downlink direction is considered, where the multi- 
antenna channel vector from the transmitter to i-th user, in the j'-th time slot, is denoted by hf(j) = [hn(j) ■■■ h iK (j)]. 
We denote by x(j) the K x 1 vector of signals sent from the array of K transmit antennas. As in [2], the point 
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made in this paper is that delayed feedback can be of use to the transmitter including the extreme situation where 
a feedback delay of one unit of time creates a full decorrelation with the current downlink channel. For this 
reason, we base ourselves on the framework of so-called delayed CSIT [2, 5-9] by which at time j, it is assumed 
that user-i has perfect knowledge of {hj(t)}j =1 and of the delayed CSIT of other users {h k (t)} J t Zi , k / i, 
while the transmitter are informed perfectly {h^i)}^"*, Vi Furthermore, we make no assumption about any 
correlation between the channel vectors across multiple time slots (could be fully uncorrelated), making it is 
impossible for the transmitter to use classical MU-MIMO precoding to serve the users, since the transmitter 
possesses some CSIT possibly independent from the actual channel. 

Recently, Maddah-Ali and Tse [2] proposed an algorithm under such delayed CSIT setting obtaining DoF 
strictly beyond that obtained without any CSIT, even in extreme situations when the delayed CSIT is made totally 
obsolete. The key ideas lie in interference repetition and alignment. Doing so, the users are able to reconstruct the 
signals overheard in previous slots to allow them to cancel out the interference completely. Particularly, in the 2- 
user case, it is assumed that three time slots are used to send a total of four symbols (two for each user), yielding 
an average rate efficiency of 4/3 symbols/channel use, while in the 3-user case, it delivers total 18 symbols in 1 1 
time slots, providing yf DoFs. Generally speaking, when there are K users, a If -phase transmission protocol is 
proposed achieving the maximum DoF if" j_ . Although such rates are inferior to the ones obtained under 
the full CSIT setting (K symbols/channel use for K antenna system), they are substantially higher than what 
was previously reported for the no CSIT case (1 symbol/channel use regardless of K). 

Although optimal in terms of the DoF, at infinite SNR, we point out that the above approach can be substan- 
tially improved at finite SNR. The key reason is that, at finite SNR, a good scheme will not attempt to use all DoFs 
to eliminate the interference but will try to strike a compromise between interference canceling and enhancing 
the detectability of the desired signal in the presence of noise. Taking into account this property of basic receivers 
leads us to revisit the design of the protocol and in particular the design of the precoding coefficients as function 
of the knowledge of past channel vectors under the name of GMAT. 

First, we proceed by reviewing the proposed protocol in the 2-user case, highlighting the connections with 
the original MAT algorithm. We then generalize the protocol to respectively the 3 and If -user cases. In the next 
section, we then turn to the problem of the optimization of the precoders. 

A. GMAT for the 2-user Case 

Here, we introduce the concept of the GMAT algorithm in the 2-user case. Note that the transmission in the 
first two time slots is identical to the MAT algorithm, with 

x(1) = b a , x(2)=s B (1) 

where x(i) (t = 1, 2) is the 2x1 signal vector sent from the transmitter at time slot t, s A and s B are 2 x 1 symbol 
vectors intended to user A and B, respectively, satisfying E{s;S^} = I. In the third time slot, the transmitter 
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now sends 



x(3) 



U A B 





(2) 



where uab corresponds to an order-2 message (i.e., a combination of two individual user messages in the 
following form) 



uab = wf s A + w^s B 



(3) 



where w x and w 2 are precoding vectors satisfying the power constraint ||wi|| 2 + ||w 2 || 2 < 2 and can be a 
function of hj(l) and hi (2) according to the delayed CSIT model. Note that this power constraint balances the 
transmit power used over three time slots. The signal vector received over the three time slots at user A is given 
by: 



[P- 



(4) 



where = [?/a(1) ?/a(2) J/a(3)] t is the concatenated received signal vector at user A in overall three time 
slots, = n A (2) nA{3)\ T is the Gaussian noise vector with zero-mean and unit-variance, P is the total 

transmit power in each time slot, and the effective signal and interference channel matrices are 



(5) 












H^i — 





, H^ 2 — 


h^(2) 




7iAi(3)wf 




/i A i(3)w 2 r 



and, by analogy, for user B, we get 

where the interference and signal matrices are: 

Hbi 



(6) 













, H B2 — 


hS(2) 


/isi(3)wf 




/i B i(3)w 2 n 



(7) 



1) A Particular Case (MAT Algorithm): We point out that the MAT algorithm [2] can be derived as a 
particular case of the above method, with Wi and w 2 specified as 



Wi = h B (l), w 2 = ^(2). 



(8) 



The key idea behind the original MAT solution in (8) is that the interference s B seen by user A arrives with an 
effective channel matrix H^ 2 which is of rank one, making it possible for user A to combine the three received 
signals in order to retrieve while canceling out s B completely. This process is referred to as alignment of 
interference signal s B , as it mimics the approach taken in interference channels in e.g. [3]. A similar property is 
exploited in (8) at user B as well by making H B1 be rank 1. 
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2) Interpretation of GMAT v.s. MAT: A drawback of the original MAT solution in (8) is to optimize the 
precoders from the point of view of interference alone while the signal matrices 11 ai and H B2 are ignored. Al- 
though this approach is optimal from an information theoretic (multiplexing gain) point of view, it is suboptimal 
at finite SNR. 

In contrast, here, the role of introduced beamformer Wi is to strike a balance between aligning the interference 
channel of at user B and enhancing the detectability of at user A. In algebraic terms this can be interpreted 
as having a compromise between obtaining a rank deficient H B1 and an orthogonal matrix for H A1 . When it 
comes to w 2 , the compromise is between obtaining a rank deficient H^ 2 an d an orthogonal matrix for H B2 - How 
to achieve this trade-off in practice is addressed in Section III. Meanwhile, we show how the above transmission 
protocol can be extended to the 3-user and then the K > 3 user cases. 

It is also important to note there might be alternative fashions of constructing finite SNR precoders based on 
delayed CSIT. For instance, an interesting question is: Can delayed feedback be exploited already in the second 
time slot with gains on the finite SNR performance? The intuitive answer to this question is yes. However, the use 
of precoders in the last time slot only generates a strong symmetry and handling of the users, which in turn allows 
for closed-form and insightful solutions. This symmetric property is also maintained in the MAT algorithm. 



B. GMAT for the 3-user Case 

Similarly to the MAT algorithm, the proposed GMAT sends 18 symbols in a total of three phases, which 



include 6, 3, and 2 time slots, respectively, giving an effective rate of yy symbols/slot. In the first phase, 6 
symbol vectors carrying all 18 symbols are sent in 6 consecutive time slots in a way identical to the initial MAT 



x(l) = s\, x(2) = 4, x(3) = sj,,x(4) = s* , x(5) = s 2 B , x(6) = s 



c 



(9) 



where s,- and sf (i = A, B, C) are 3 x 1 symbol vectors (referred to as the order-1 messages) intended to user-i. 
As in the 2-user case, we do not introduce channel dependent precoding in the first phase in order to preserve 
symmetry across the users. Instead, feedback based precoding is introduced in the second phase. 

Phase-2 involves 3 time slots, in each of which two order-2 messages (defined as a combination of two order-1 
messages) are sent from the first two transmit antennas: 



x(7) 



where the order-2 messages are constructed by 







U AC 




U BC 


U AB 


, x(8) = 


U AC 


, x(9) = 


U BC 
















(10) 



U AB 



W 



1 T„l 
12 



S A + W 



1 T 1 



U AB 



IT 1 2 T 2 

w 12 s A + w 21 s B 



u 



,2 T„2 



,2 T„2 



AC 



(ID 

(12) 
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U 



BC 



1 T„l 



'23 



U 



= w2, T s 2 R + w 



2 T„2 



'23 



(13) 



where u*- and (z 7^ j) are two realizations of the order-2 message dedicated to user-i and user-j, and 



wj 4 G C 3 * 1 , 



w|j G C , 1 < i,j < 3 can be arbitrary vector functions of hj(i), i = A, B, C, t = 1, ■ ■ ■ ,6. 
The responsibility of phase-2 is to provide independent equations with regard to s] (or sf) by utilizing the 
overheard interferences in the previous phase. 

Finally, in the last phase, channel dependent precoding is not introduced as this allows to obtain decoupled 
optimization problems for each of the as will be made in Section III. In this phase, two order-3 messages 
sent at the first transmit antenna within two consecutive time slots, i.e., 



x(10) 









U ABC 




U ABC 





, x(ll) = 














(14) 



where u l ABC (Z = 1, 2) is the order-3 messages which are identical to the original MAT algorithm 

U ABC = a l 

(h cl (7)u\ B + h C2 (7)u\ B ) + a l 2 (h m (8)u\ c + h B2 (8)u AC ) + a l 3 (h A1 (9)u BC + h A2 (9)u 



Be) 



where {a l j} (j = 1, 2, 3) are chosen in a way similar to the original MAT, i.e., arbitrary yet linearly independent 
sets of coefficients and known by both transmitter and receivers. 

Without loss of generality, we treat user A as the target user, and the compact received signal model in matrix 
format over the 1 1 time slots can be given by 

I 2 I 2 I 2 

= v t E + v t E s ^ s * + v t E & A3 s c + «A (15) 
v 6 1=1 v 6 1=1 v 6 1=1 

where the equivalent channel matrix can be formulated as 



H 



Al 



Al 

B l A (2)W[(2) 
D^(3)W<(3) 



H 



A2 



H 



A2 



D^(2)W 2 (2) 
D^(3)W<(3) 



,H 



A3 



H A3 

D^(2)W 3 (2) 
D^(3)W^(3) 



G C 



11x3 



(16) 



where 



-.6x3 



where m[ = (3(1 — 1) + j — 1), n[ = 6 
D^(3) = diag{/i A1 (10), h A1 (U)}, and 



h A (m[ + 1) 
3(1 and D^(2) 



(17) 



diag{h Al (7),h Al (8),h Al (9)}, 



W z (2) 



l T 
12 

J T 



,v 13 

0i x3 



W<(2) 



W 21 

0lx3 

W 23 

W<(2) 



0lx3 

/ T 



W 



W.' 



I T 



32 



W<(2) 



G C 



3x9 



(18) 
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is the global precoding matrix (which is referred to hereafter as the order-2 message generation matrix) and 
W' (2) is corresponding to user-j. 

Given the order-2 message generation matrix W^ (2) G C 3x3 , the precoding matrix for the third phase 
(referred to as order-3 message generation matrix) can be recursively obtained by 



Wj.(3) = C'(2)A'(2)Wj(2) G C 2x3 , j = 1,2,3 
where A ( (2) = diag{hci(7), h B i(8), ^a;(9)} is set identically to MAT for simplicity, and 

C'(2) = 



(19) 



(20) 



W 1 (2) 



(21) 



is a constant matrix known by both transmitter and receivers. 

1) A Particular Case (MAT Algorithm): The original MAT algorithm can be deduced from the proposed 
method by selecting 

h£(l) h T A {2) 1X3 " 
h£(l) 1X3 hj(3) 
^0 1X3 h£(2) h£(3)_ 

where W 2 (2) can be obtained in an analogous way. 

Similarly to the 2-user case, interferences carrying unintended symbols and are aligned perfectly at 
user A, and hence matrices & A2 and & A3 are rank deficient with total rank of 5, making the useful symbol 
retrievable with the left 6-dimensional interference-free subspace. For the proposed GMAT algorithm, we seek 
to balance signal orthogonality (conditioning of H^) and perfect interference alignment by a careful design of 
W'(2). 



C. GMAT for the General K-user Case 

In i^-user case, the maximum achievable DoF is d = „k i [2]. Let d = where T is an integer 

representing the overall required time slots and L is the number of repeated transmission to guarantee T to be 
an integer. Without loss of generality, we assume L = (K — 1)!. The total T times slots can be divided into K 
phases. In phase- 1, there consists of LK time slots. As the same way to the MAT algorithm, an order- 1 messages 
x(t) is sent in i-th time slot, i.e., 

x(*) = s{, I = !,■■■ ,L (22) 

satisfying t = L(l — 1) + i, where s- is the K x 1 symbol vector intended to user-i. 

From phase-2 to phase-iT, the transmission of GMAT is similar to MAT algorithm. Each phase-A; (2 < k < 
K) requires T k = time slots, with each time slot transmitting k order-A; message from k transmit antennas, 
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i.e., 



x(f) 



u k 



(23) 



where u 3 Sk (1 < j < k) is the j-fh message realization of the order-A? message, which can be generated by 

u l Sk =W l (k)s l (24) 

where u l Sk is the Q k x 1 vector (Q^ = (^)) with each element being order-fc message that can be interpreted 
as the combination of any k symbol vectors from {s'} (1 < / < L); S k is the set of dedicated users and satisfies 
|<S fe | = k; s l = [s[ T ■ ■ ■ s l ^] T G C r2x1 is the concatenated symbol vector, and W l (k) € C QkXl<2 is the 
order-A; message generation matrix, whose definition is as follows: 

Definition 1 (Order-A; Message Generation Matrix). The order-k message generation matrix W (k) = 
W'^ifc) • • • Wjf (fe) (2 < k < K) is a Q k x K 2 matrix which satisfies: 

1) it contains k nonzero and K — k zero blocks in each row, where each block is 1 X K row vector; 

2) the positions of nonzero blocks of any two rows are not identical; and 

3) it contains all possibilities of k nonzero positions out of total K positions in each row. 

We point out that the order- k message is desired by those k users whose symbols are contained, and acts as 
interferences that will be overheard by other K — k users. 

Based on the above definition, the signal model of K-user GMAT protocol can be extended as 



(25) 



where 



HMi) 



%{k) 



iTxK 



(26) 



with T = J2?=i Tk, is defined as follows: 

• The first submatrix corresponds to the effective channel matrix in phase- 1 , which can be given by 

Q m [ x K 



H^(l) 







n xK 



G C TlxK 



(27) 



where j = l,...,K,l = 1,...,L, m[ = (K(l-l)+j-l),n[ = KL - K(l - 1) -j, and t = m[ + 1; 
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The A-fh submatrix (2 < A < K — 1) which corresponds to phase-A can be formulated as 

0, 



H' (A:) 



xK 



G C TkXK 



(28) 



D|(A:)Wj(A:) 

k _ 

where m\ = - l) Q k , n l k =T k - \ l -£]Q k with l k = andDj(fe) = diag{M*)} G C«* x «* 

corresponds to the present channel over whom the order- A; message is sent in phase-fc with s = ((I ■ l k ) 
mod L) mod k and t being the index of time slots. In general, (A) (A > 2) is the order-A message 
generation matrix specified to user-j, which is recursively defined according to 



W'(A + 1) = C Z (A)A'(A)W'(A) 



(29) 



where C'(A) G £Qk+ixQ k i s a cons tant matrix known by transmitter and all users, satisfying: (l)eachrow 
contains A + 1 nonzero elements, and (2) the positions of nonzero elements of any two rows are different 
one another; and A' (A) G C QkXQk is a diagonal matrix whose elements are chosen to be a function 
of the channel coefficients in phase-A, so that the interference overheard can be aligned within a limited 
dimensional subspace. For simplicity, we place emphasis on (A), letting A' (A) be predetermined as the 
channel coefficients in phase-A like the original MAT algorithm. 
The last submatrix is corresponding to the last phase, i.e., 



HUK) = T>\(K)W l AK) G C Tk '' 



(30) 



where (K) is defined similarly to (29), in which C l (K — l) G C Tk xQk ~ 1 is a full rank constant matrix 
without zero elements, andD' (K) = dmg{hn(t)} G C TkxTk contains channel coefficients during phase- 
K. 

For further illustration, we take the 4-user case for example to show its order-2 message generation matrix, 
i.e., 



W'(2) 



W 12 


W 21 








W 13 





W 31 





W 14 








W 41 





W 23 


•or 1 T 
W 32 








W 24 





W 42 








W 34 


W 43 



(31) 



where w', G C Kxl 
j 1 



is the beamforming vector aiming at the compromise between user-i and user-j. This 
formulation collapses to (1 1)-(13) for the 3-user case and to (3) for the 2-user case. 
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1) A particular Case (MAT Algorithm): Particularly for the 4-user case, the original MAT algorithm is a 
specialized GMAT algorithm by setting order-2 message generation matrix as 



W 1 (2) 



h|(l) hj(2) 



h^(3) 








h£(l) 












hS(2) 



(32) 







h£(2) h£(3) 

hi (4) 
h£(3) h£(4)_ 

for Z = 1 and similarly for other I. For example, for user A, the interference channels • (j ^ 1) are perfectly 
aligned, leaving K = 4 interference free dimensions for desired signal, and therefore making the intended 
symbols retrievable at user A. Similarly for other users, all symbols can be recovered. Hence, 96 symbols are 
delivered within 50 time slots, providing the sum DoF of ||. 

It is worth noting that the higher level messages can be delivered by the combination of lower lever messages. 
For example, from phase k to K, the message delivered to the receivers aiming at completely decoding the 
order- k message. To avoid too many parameters being optimized which requires huge complexity, we will focus 
merely on the design of the order-2 message generation matrices {W' (2)}. 

III. GMAT Optimization Design 

The computation of {Wj(2)} can use several options. Two of them are briefly described in the following 
sections. The first is based on the optimization of a virtual MMSE metric, yielding an iterative solution, while the 
second one considers the maximization of an approximation of the mutual information, yielding suboptimal yet 
closed-form solutions. Note that none of these approaches have anything in common with finite SNR interference 
alignment methods with non-delayed CSIT, such as, e.g., [11-13], since the nature of our problem is conditioned 
by the delayed CSIT scenario. 



A. Virtual MMSE Metric 

In the following, we describe an approach based on a virtual MMSE metric (referred to later as "GMAT- 
MMSE") for the 2-user case, and subsequently generalize it to the If -user case. 

1) Special K = 2 Case: Since the transmitter does not know hi (3) at slot-3, the optimization of the 
precoder in (5) and (7) cannot involve such information. Fortunately, we point out that the trade-off between 
interference alignment and signal matrix orthogonalization presented above can be formulated in a way that is 
fully independent of hi (3). To do so, we introduce the virtual received signal y s given below, where hi (3) is 
ignored (deterministic fading is assumed over the third time slot): 



<P P 

— n. a s A + W — Hi 2 s B 



A, B 



(33) 
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where the virtual channel matrices are now modified from (5) and (7) by simply setting hn (3) = 1: 

H 2l = , H j2 = hf(2) , i = A,B. (34) 

T T 

Given Wi and w 2 , the optimum RX MMSE filters at user-i over this channel are given by 

V, = ^(pHaH^ + pH^Hg + I)- 1 ^ (35) 

where p = (here K = 2), and the corresponding optimal MSEs are 

J,( Wl ,w 2 ) = Tr(l-pH£(pH il H£ + pH i2 H* + I)- 1 H 4l ) (36) 
Hence, the optimal w 1; w 2 can be obtained from the following optimization problem, i.e., 

min J = Ja(wi,w 2 ) + Jb(wi, w 2 ) (37) 

wi,w 2 :||w 1 p + ||w 2 ||2<2 

In practice, the gradient based approaches can be used to perform optimization although the convexity of the 
problem is not guaranteed. 

2) General K-user Case: In phase-/c, the transmitter does not know hj(i) at slot-i, where t = Yli=i + 
1, • • • , X)j=i Ti- Similarly to the 2-user case, the virtual received signal can be generalized as 



where 



H' 



K 



E h ^ + VkE £ H^sJ + m, • = !,■■ ■ 



H'.. T 



0^ xmi W' r (fc) K> 



W^. T (K) 



(38) 



(39) 



whose elements are defined in Section II. 

Similarly, given (2), the optimum MMSE filters for s' at user-z becomes 

/ l k X" 1 



(40) 



where p = ^ is the normalized transmit power, and the corresponding optimal MSEs are 

Ji(W<(2), j = 1, ■ • • , K) = Tr (l - p&» (p£ £ H l H '/ + J ) H * 



(41) 



The optimal solutions of {W'-(2),j = 1, ■ ■ • , K} in the sense of virtual MMSE at receiver side are now 



given by: 



L K 

min J = yyj\{WA2)) 



(42) 
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L K 

s.t. J]^||W;.(2)|| 2 F <KT 2 . (43) 

1 = 1 3=1 

As the above optimization does not lend itself easily to a closed-form solution, we propose an iterative procedure, 
based on the gradient descent of the cost function J, where W^ (2) is iterative updated according to 

Wi(2)[n + l]=W5(2)[n]- i 9^^ (44) 

where n is the iteration index and B is a small step size. The partial derivation is given in the Appendix. 
Nevertheless, to circumvent non-convexity issues, we explore an alternative optimization method below. 



B. Mutual Information Metric 

Here, we propose an approach based on maximizing an approximation of the mutual information, yielding a 
convenient closed-form solution for { W' (2) }. In the following, we will start with the 2-user case to gain insight, 
and then generalize it to the If -user case. 

1 ) Special 2-user Case: Recall that 

y A = ^HaiSa + ^oH^Sb + n A (45) 

where p = (here K = 2), wi and w 2 are functions of hi(j),i = A, B, j = 1,2 and satisfy power constraint 
II w i IP + II w 2 IP < 2. Consequently, the exact mutual information of user A can be calculated by 

I(s A ;y A ) = logdet(l+ (l + pH A2 H^ 2 ) _1 pHAiH^) (46) 
= log det ^1 + p 

where the second line is easily obtained by permuting rows 2 and 3 in and ~H.A2, and the third line by 
the characteristic polynomial equality [14], det (I + pM) = 1 + p Tr(M) + p 2 det(M), where M is a 2 x 2 
Hermitian matrix. By analogy, the mutual information of user B can be given by 

7(s s ;y s ) = log(l + p||h B (2)f + |g|) (49) 

where 

e^wO = (i + P ||M2)|| 2 )H/ui(3)r(||wj 2 + ^ (50) 

A 1 (w 2 ) = (l + />||h A (2)|| 2 )(l + H^i(3)ri|w 2 || 2 )- p 2 |/ iA1 (3)| 2 wfh A (2)h A (2) ff w 2 (51) 
2 (w 2 ) = (1 + p||h B (l)|| 2 ) / 9|/ lsi (3)| 2 (||w 2 || 2 + / 9||w 2 || 2 ||h B (2)|| 2 - pwf h B (2)h B (2) H w 2 ) (52) 
A 2 ( Wl ) = (l + p||h B (l)|| 2 )(l + p|/ iB1 (3)| 2 ||w 1 || 2 )-p 2 |/ lB1 (3)| 2 wfh B (l)h B (l) ff w 1 (53) 





l+llhf (2)|| : 
Ai(w 2 ) J 



|hf(l)|| 2 
/i A1 (3)hf(l) Wl 



h* A1 (3)w»h A (l 
\h A i (3) | 



2 llw II 2 
1 



(47) 
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By imposing a symmetric constraint for power allocation between Wi and w 2 , e.g., ||wi|| 2 = ||w 2 || 2 = 1 for 
simplicity, the sum mutual information can be deduced to 

/(s,;y,) + I(s B ;y B ) = log (l + + log ( 1 + + logC (54) 

V wfR 2 w 2 / V wfQiWi7 



where 



R! = (l + p||h A (2)|| 2 )(l + / 9hi(l)hi ff (l)) (55) 

R 2 = (l + p||h A (l)|| 2 )( 7l I + phi(2)hi ff (2)) (56) 

Q 1 = (l + p||h B (2)|| 2 ) ( 72 I + ph B (l)h s "(l)) (57) 

Q 2 = (l + p||h B (l)|| 2 ) (l + ph B (2)h B ff (2)) (58) 



where 



7 ,.i±f^ + l |W2 l^ 72 .l±M£ + l | Wlf , 

p\h A1 (3)\ 2 p\h B1 (3)\ 2 
C = (l + / >||h A (l)|| 2 )(l + p||h B (2)|| 2 ) (60) 

and h^(j) G C 2xl is the orthogonal channel of h;(j) (i = A, B, j = 1, 2) satisfying 

Mj)hf (j) + ht(j)hi H (j) = IMjWi- (61) 

In the high SNR region, we get a useful approximation of the sum of mutual informations, i.e., 
j( \ , Tf ^ i /wfR lWl wf Q 2 w 2 \ 

/(s A ;y.) + I(s B ;y B ) « log ^ ^ — J + log C (62) 

which can be optimized by separately maximizing the two Rayleigh Quotients, i.e., 

wfRiWi wf (1 + ^(1)11^(1)^ 

max — n— = max — tt-t r~, s, nrrrs (63) 

l|wi|| 2 =i wfQiWi l|wip=i wf (7 2 I + ph^(l)h^(l)) wi 

wfQgWg wf (l + ph B (2)h B g (2))w 2 

max — = max — 5— r— - — , „ ; NN (64) 

||w 2 |p=i w fR 2 w 2 l|w 2 p=i wf (7il + ph^(2)h^(2)) w 2 

Hence, we can obtain the optimal solutions w° p * and w 2 p *, which are given by the dominant generalized 

eigenvectors of the pairs (Ri, Qi) and (Q 2 , R 2 ), respectively. 

Interestingly, the above objective function can be interpreted as dual SINR in a 2-user interference channel. 

Define 

DSINR, = r rrrv 1 L (65) 

wf ( 7l I + ph | L (z)h-^(i))w i 

which is referred to as a regularized SINR in a dual 2-user interference channel with a desired channel Yif and 

interference channel h^ 1 , where i ^ i, and w, is interpreted as a receive filter. Thus, the optimization problem in 

eq-(63) can be equivalently done by maximizing the regularized SINR in the dual MISO interference channels. 

Note that the regularization lies in not only the interference channels but also the desired channels. This solution 

is referred to later as "GMAT-DSINR". 
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2) General K-user Case: Recall that the definition of DSINR in eq-(65) for the 2-user case, where w, is 
determined by the orthogonal channels of itself and also its peer. According to the structure of W'(2) for the 
if -user case, we can follow this approach and design each nonzero submatrices distributively. For each w^, 
the dual interference channel can be constructed by the orthogonal channels between itself hf and its peer . 
Thus, the regularized dual SINR can be formulated as (e.g., I = 1) 

wS,"(i+(iE w i"i(j>;"(j))w; 1 

where G C Kxl is the i-th (when i < j) or (i-l)-th (when i > j) nonzero block of W' (2), h^j) G C KxK 
is one representation of the null space of hj(j') with the same norm 1 , and 

l ji = \\-w l ji \\ 2 + \\h i (j)\\ 2 + l/p (67) 

Accordingly, the optimal can be obtained by distributively optimizing 

max {DSINR' j / i} (68) 

L K 

s.t. EEH W K 2 )IIf < KT 2 . (69) 

1=1 j=l 

where the corresponding solution can be simply obtained by generalized eigenvalue decomposition. By maxi- 
mizing the dual SINR, is preferred to keep aligned along with hj(j) while to be as orthogonal to h fc (j) as 
possible. Consequently, the optimal solution of balances signal orthogonality with interference alignment 
between user-j's and other users' dual orthogonal channels at j-th time slot. 

IV. Discussion 

A. Multiplexing Gain of GMAT 

In the following, we show the GMAT algorithm possesses the same multiplexing gain as original MAT. We 
consider the 2-user case for example. According to equations from (54) to (64), we have 



Elogl max.. 



Elog 



Ml) 



lim^oo ^ " W ": lwfQlWl/ = Hm^^ - ' w ' 1=1E ^ = 1 (70) 



V>->oo i ogp " " " iogp 



i; n , .... ... ~2 R 2^2j _ - w 2 iih^cajii _ i (7]) 

nnV^oo logp — miip^oo losp —i (iL) 

Thus, together with the fact that lim^oo = 2, the multiplexing gain can be achieved with 

r Emax|| Wl || a= i,|| W2 ||2 =1 (/(s A ;y A ) + I(s B ;y B )) 4 
MGgmat = lim - — - - — !L - — = - (72) 

P^oo 3 lOg p d 

which is identical to the original MAT algorithm. Intuitively, at high SNR, the signal orthogonality becomes no 
relevance, thus our solution naturally seeks perfect interference alignment as in MAT. 



'We abuse here the vector notation to represent the corresponding orthogonal channel matrix for the sake of consistence. 
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B. Single-beam MIMO Interference Channel Interpretation 

To understand more clearly the roles of desired signal orthogonality and interference alignment, we transform 
the mutual information equality (54) into another form, and further interpret their relationship from the point 
of view of a two-user single -beam MIMO interference channel. The strong benefit of this interpretation is that 
the problem of computing the precoders lends itself to classical precoding techniques in the MIMO interference 
channel. Based on eq-(54), the sum mutual information equation can be further transformed to 



I( s a;Ya) +I(s B ;y B ) 

q 1 pw^h y4 (l)h^(l)w 1 + a 2 pw^hi(l)hi H (l)w 1 
a\ + /3 3 pwf h A (2)h£ (2)w 2 + 8 4 pw? K(2)hi H (2)w 2 

gipwf h B (2)hg(2)w 2 + ggpwf h£(2)h£ g (2) w 2 
a\ + a 3 pwf h B (l)hf (l) Wl + a 4/ 9wf h^ljhf (l) Wl 



= log^l + 

+ log (1 + 



+ logC 



where 



Pi 



a 2 



l+p||Ml)|| 2 



l + p||h A (2)|| 2 

'"2 = IIU 7TTTT9 > a 3 

p\\^a{1)\\ 2 

l + p||h B (l)|| 2 
) "2 — in 7n\7i9 ) "3 : 



l + p||h B (2)||2'^ HIM2)|| 2 

1 . I! ||2 2 1 

+ ll w 2|| ,cr 2 - 



H^i(3)| 2 || Wl ||2 
1 

H^i(3)| 2 ||w 2 || 2 



, a 4 = a 3 + 1, 
,84 = 83 + 1, 



+ Wi 



(73) 
(74) 

(75) 

(76) 
(77) 
(78) 



1 p|W3)| 2 " z " ' 2 H^i(3)| 2 
According to eq-(74) and eq-(75), the sum mutual information can be treated as that of 2-user MIMO interference 

channels with 2 antennas at each transmitter and receiver, as shown in Fig. 1. Note that w x and w 2 act as the 

transmit beamformers, where the single beam is transmitted from each transmitter. 

Accordingly, the received signal at two receivers can be equivalently expressed as 



yi = y^oHiWiSi + ^/7jH 2 w 2 s 2 + 
y 2 = v / pG 2 w 2 s 2 + y^GiWiSi + n 2 



5ihf(l) 



B 1 

Slh^(l) 



where 

r^rhf(i)i r^hf(2) 

ill — j "-2 — \j 

2 

and the noises are distributed with ri; ~ CAT(0, ^-1), respectively. 

Consequently, the received SINR for two users can be written, respectively, as 

pIlHiWiH 2 pwfHfHiWi 



/fthf(2 



B > 

/Sh^(2) 



(79) 
(80) 

(81) 



SINR! 



SINR 2 



o\ + p||H 2 w 2 | 
p||G 2 w 2 || 2 



o\ + pwf Hf H 2 w 2 
pw 2 ¥ G 2 ¥ G 2 w 2 



(82) 



(83) 



<t| + pllGiWxIl 2 a\ + pwf Gf dwi 
which are identical to those in eq-(74-75). Hence, existing precoder design methods in the two-user single-beam 
MIMO interference channels with perfect CSIT, e.g., [13, 15-18], can be used here in the context of delayed 
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CSIT precoding. Instead of going into details about those solutions, we take the classic MRT and ZF precoders 
here for example, 

wf RT = U max (Hf Hi) , wf« T = U max (G? G 2 ) (84) 

wf F = U min (Gf d) , wf F = U min (Hf H 2 ) (85) 

where U max (-) and U min (-) are the generalized eigenvectors corresponding to the largest and smallest eigen- 
values, respectively. Interestingly, for the first user, it is worth noting that < a 2 and therefore \v^ !RT — > 
h^(l), means perfect orthogonality of desired signal is preferred. On the other hand, q 3 < a 4 , which denotes 
wf F — > h s (l), corresponds to the preference of perfect interference alignment. Our proposed GMAT-MMSE 
and GMAT-DSINR solutions offer a trade-off between them, yielding a better performance in finite SNR regime. 

V. Numerical Results 

The effectiveness of the proposed solutions is evaluated in terms of the sum rate per time slot in bps/Hz over 
a correlated rayleigh fading channel, where the concatenated channel matrix in slot-t can be formulated as 

H(t) = R.y 2 H^)R t 1/2 (86) 

where H w (t) is normalized i.i.d. rayleigh fading channel matrix while R 4 and R r are transmit and receive cor- 
relation matrices with (i, j)-th entry being rj* ^ and rj, 4- - 7 ' [19, 20], respectively, where r t and r r are randomly 
chosen within [0, 1). Note that the users' channel vectors are the rows of H(i). 

The parameters in the simulation are set as follows: maximum 500 gradient-descent iterations for the GMAT- 
MMSE, j3 = 0.01. The performance is averaged over 1000 channel realizations. Recall that the present channel 
coefficients (c.f. T)\ k \ e.g., h A1 (3) and h B i{3) for the 2-user case) are unknown for the transmitter and therefore 
are ignored for precoder design, while they should be taken into account at the receiver for MMSE receive 
filter design. Naturally, such a mismatch would result in performance degradation, but our proposed precoding 
methods are verified to be always effective thanks to the efficient trade-off between interference alignment and 
signal enhancement. 

We show in Fig. 2 for the 2-user case the sum rate comparison with MMSE receiver among GMAT-MMSE 
with the iteratively updated w 1; w 2 , GMAT-DSINR with closed-form solutions in eq-(63-64), and the original 
MAT algorithm with Wi = h B (l),w 2 = h A (2), with the same power constraint ||w 1 || 2 + ||w 2 || 2 < 2forall.In 
Fig. 2, the gap of sum rate between GMAT and MAT illustrates improvement of the GMAT-MMSE and GMAT- 
DSINR algorithms over the initial MAT concept, demonstrating the benefit of the trade-off between interference 
alignment and desired signal orthogonality enhancement. Compared with the original MAT algorithm, the two 
GMAT approaches have gained great improvement at finite SNR and possessed the same slope, which implies 
the same multiplexing gain, at high SNR. Interestingly, the closed-form solution performs as well as the iterative 
one, indicating the effectiveness of the mutual information approximation. 
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In Fig. 3, we present the similar performance comparison for the 3-user cases. The GMAT-MMSE solution 
updates order-2 message generation matrix W^ 2 ^ iteratively, while the original MAT algorithm set it according 
to eq-(21) and the GMAT-DSINR solution is obtained by optimizing eq-(68) and eq-(69). All these methods 
hold the same power allocation. With more transmit antennas and users, the same insights regarding the trade-off 
between signal orthogonality and interference alignment can be always obtained. It is interesting to note that, 
GMAT-DSINR performs as well as GMAT-MMSE, despite the distributed optimization. 



We generalize the concept of precoding over a multi-user MISO channel with delayed CSIT for arbitrary 
number of users case, by proposing a precoder construction algorithm, which achieves the same DoF at infinite 
SNR yet reaches a useful trade-off between interference alignment and signal enhancement at finite SNR. Our 
proposed precoding concept lends itself to a variety of optimization methods, e.g., virtual MMSE and mutual 
information solutions, achieving good compromise between signal orthogonality and interference alignment. 

Appendix 

A. Gradient Descent Parameter for GMAT-MMSE 

Let [Hy TO; „ = e^H' -e„ be the m-th row and n-th column element of H' •. Particularly, 



VI. Conclusion 



[Hy ro ,„ = e£,W<(£0e 



■n 



(87) 








if m <T\ 



< 




ifT 1 + l<m<T 1 +T 2 



(89) 



>«&n£c(t)A'(t) 



*/ £s=i T s + l<m< £* =1 T s when k>3 



(90) 



where 




I 



Q 



(91) 




n 



K-\ 
t=2 
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Note that we abuse vector e m > with various dimensions T k according to the corresponding matrices Wj(fc) for 
the sake of notational simplicity. Then, it follows that 



j 

where 1 < p < T k , 1 < q < K, and we have 

3H 



e^Q'e p efe„ (92) 



9[Wj r (2)] w 

Finally, according to the chain rule of matrix differentiation [21, 22], we have 



QVf (93) 



8M) - rJ(M-) T ^r- -W^Vl <*> 



So, for the i^-user case, the Gaussian descent parameter can be calculated by 

a (j) _ y(M\ T o' (95) 



where 



0W<(2) ^SW<(2) tU 9H W 



5 =,(VPH^£ £ HLH^ + I) (97) 



axil I 

v an y / \ ;=i k=l,kjtj 



where 



/(A, B) = -A fl (AA fl + B) 1 B(AA fl + B) 1 (98) 
g(A,B) = A H (AA fl +B) _1 (B-I) (AA fl + B) _1 (99) 
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Fig. 2: Sum rate vs. SNR for the 2-user case. 
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Fig. 3: Sum rate vs. SNR for the 3-user case. 



